Skip to content

Conversation

@joseph-isaacs
Copy link
Contributor

No description provided.

@joseph-isaacs joseph-isaacs changed the title feat[cuda]: to_host canonical feat[fuzz]: cuda kernel fuzzer Jan 23, 2026
@codspeed-hq
Copy link

codspeed-hq bot commented Jan 23, 2026

Merging this PR will degrade performance by 84.24%

⚠️ Unknown Walltime execution environment detected

Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.

For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.

⚡ 11 improved benchmarks
❌ 8 regressed benchmarks
✅ 1246 untouched benchmarks
⏩ 1294 skipped benchmarks1

⚠️ Please fix the performance issues or acknowledge them on CodSpeed.

Performance Changes

Mode Benchmark BASE HEAD Efficiency
WallTime u64_values_u8_codes[10M] 260.3 µs 222.8 µs +16.84%
WallTime u8_FoR[10M] 8 µs 50.9 µs -84.24%
Simulation canonical_into_non_nullable[(10000, 1, 0.0)] 36.2 µs 25.8 µs +40.31%
Simulation canonical_into_non_nullable[(10000, 1, 0.01)] 41.1 µs 32.3 µs +27.31%
Simulation canonical_into_non_nullable[(10000, 10, 0.1)] 471.6 µs 382.2 µs +23.4%
Simulation canonical_into_non_nullable[(10000, 1, 0.1)] 57 µs 48.1 µs +18.63%
Simulation canonical_into_non_nullable[(10000, 10, 0.0)] 278.9 µs 195.6 µs +42.57%
Simulation canonical_into_non_nullable[(10000, 10, 0.01)] 306 µs 222.7 µs +37.45%
Simulation into_canonical_non_nullable[(10000, 1, 0.1)] 55.3 µs 63.7 µs -13.17%
Simulation into_canonical_non_nullable[(10000, 1, 0.0)] 33 µs 41.1 µs -19.65%
Simulation into_canonical_non_nullable[(10000, 10, 0.01)] 310.4 µs 227.8 µs +36.24%
Simulation canonical_into_nullable[(10000, 100, 0.0)] 4.3 ms 4.9 ms -12.51%
Simulation into_canonical_non_nullable[(10000, 1, 0.01)] 39.2 µs 47.1 µs -16.92%
Simulation into_canonical_non_nullable[(10000, 10, 0.0)] 283.5 µs 200.2 µs +41.58%
Simulation into_canonical_non_nullable[(10000, 10, 0.1)] 472.8 µs 383.7 µs +23.24%
Simulation into_canonical_nullable[(10000, 10, 0.0)] 456 µs 536.8 µs -15.06%
Simulation into_canonical_nullable[(10000, 10, 0.1)] 716.9 µs 627.6 µs +14.23%
Simulation into_canonical_nullable[(10000, 100, 0.0)] 4.3 ms 5 ms -14.39%
Simulation into_canonical_nullable[(10000, 100, 0.1)] 6.1 ms 6.9 ms -11.97%

Comparing ji/fuzz-arb-gpu (6c8c9e8) with develop (a66a58d)

Open in CodSpeed

Footnotes

  1. 1294 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports.

@joseph-isaacs joseph-isaacs requested a review from 0ax1 January 26, 2026 13:08
@joseph-isaacs joseph-isaacs marked this pull request as ready for review January 26, 2026 13:08
@joseph-isaacs joseph-isaacs added the changelog/chore A trivial change label Jan 26, 2026
@joseph-isaacs joseph-isaacs enabled auto-merge (squash) January 26, 2026 13:09
@0ax1
Copy link
Contributor

0ax1 commented Jan 26, 2026

@claude review

@claude
Copy link
Contributor

claude bot commented Jan 26, 2026

[gone]

@joseph-isaacs joseph-isaacs requested a review from 0ax1 January 26, 2026 14:09
@0ax1 0ax1 disabled auto-merge January 26, 2026 17:21
Signed-off-by: Alexander Droste <alexander.droste@protonmail.com>
@0ax1 0ax1 force-pushed the ji/fuzz-arb-gpu branch 5 times, most recently from c39fc2a to 1947754 Compare January 26, 2026 21:19
0ax1 added 3 commits January 26, 2026 21:23
Signed-off-by: Alexander Droste <alexander.droste@protonmail.com>
Signed-off-by: Alexander Droste <alexander.droste@protonmail.com>
Signed-off-by: Alexander Droste <alexander.droste@protonmail.com>
Signed-off-by: Alexander Droste <alexander.droste@protonmail.com>
@0ax1 0ax1 merged commit 1ba9c80 into develop Jan 26, 2026
45 of 47 checks passed
@0ax1 0ax1 deleted the ji/fuzz-arb-gpu branch January 26, 2026 21:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

changelog/chore A trivial change

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants